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DATE- 02/06/2002 
RAW SEQUENCE LISTING UA1 ^" 1^ n a „ 

PATENT APPLICATION : US/10/047,25 3 TIME: 19:09:22 



Input Set : A:\CIT1510-4.ST25.txt 
Output Set: N:\CRF3\02062002\J047253.raw 



ENTERED 



2 <110> APPLICANT: CALIFORNIA INSTITUTE OF TECHNOLOGY 

3 COPE, Gregory 

4 VERMA, Rati 

5 ARAVIND , L 

6 KOONIN, Eugene 

; <!=„> SS'S'iSSS&r regolatioh of target protein activity through hodjeiek 

PROTEINS 

11 <130> FILE REFERENCE: CIT1510-4 
Q^k>13 <140> CURRENT APPLICATION NUMBER: US/10/047 , 253 
A£> 13 <141> CURRENT FILING DATE : 2002-01-14 

13 <150> PRIOR APPLICATION NUMBER : US 60/261,314 

14 <151> PRIOR FILING DATE: 2001-01-12 

16 <150> PRIOR APPLICATION NUMBER: US 60/322,322 

17 <151> PRIOR FILING DATE: 200 1 - 09 - 14 

19 <150> PRIOR APPLICATION NUMBER: US 60/322,030 

20 <151> PRIOR FILING DATE: 2001-09-14 
22 <160> NUMBER OF SEQ ID NOS : 22 

24 <170> SOFTWARE: Patentln version 3.1 

26 <210> SEQ ID NO: 1 

27 <211> LENGTH: 14 

28 <212> TYPE: PRT 

29 <213> ORGANISM: Artificial sequence 
31 <220> FEATURE: 

3 2 <2 2 3> OTHER INFORMATION: JAM domain 
34 <220> FEATURE: 

3 5 <2 21> NAME/KEY: MISC_FEATURE 

36 <222> LOCATION: (1)..(14) 

37 <223> OTHER INFORMATION: Xaa is any amino acid 
39 <400> SEQUENCE: 1 v 

41 His Xaa His Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp 

42 1 5 1° 

45 <210> SEQ ID NO: 2 

46 <211> LENGTH: 17 

47 <212> TYPE: PRT 

48 <213> ORGANISM: Artificial sequence 

50 <220> FEATURE: 

51 <22 3> OTHER INFORMATION: JAM domain 

53 <220> FEATURE: 

54 <221> NAME/KEY: MISC_FEATURE 

55 ^222> LOCATION: (3).. (3) 

56 <223> OTHER INFORMATION: Xaa is Tyr or lie 

58 ^220> FEATURE: 

59 <221> NAME/KEY: MISC_FEATURE 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/0 47,2 5 3 

input Set : A:\CIT1510-4.ST2 5.txt 
Output Set: N:\CRF3\02062002\J047253.raw 

60 <:222> LOCATION: (5).. (5) 

61 <223> OTHER INFORMATION: Xaa is Ser or Thr 

63 <220> FEATURE: 

64 <22l 

65 <222 

66 <223 

6 8 -:400 > SEQUENCE: 2 



DATE: 02/06/2002 
TIME: 19 : 09 : 22 



NAME/KEY: MISC_FEATURE 
> LOCATION: (8) . . (16) 

OTHER INFORMATION: Xaa is any amino acid 



68 -:400 > SEQULNCt,: z 

'70 Gly Trp Xaa His Xaa His Pro Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Xaa 



71 1 5 10 

74 Asp 

78 <210> SEQ ID NO: 3 

79 < 211> LENGTH : 246 

80 <212> TYPE: PRT 

81 <213> ORGANISM: Homo sapiens 

83 <400> SEQUENCE: 3 , , 

85 Ihr Met lie lie Met Asp Ser Phe Ala Leu Pro Val Glu Gly Thr 

89 Thr Arg Val Asn Ala Gin Ala Ala Ala Tyr Glu Tyr Met Ala Ala Tyr 

93 lie Glu Asn Ala Lys Gin Val Gly Arg Leu Glu Asn Ala lie Gly Tr P 

9 9/ Tyr His Ser His Pro Gly Tyr Gly Cys Trp Leu Ser Gly He Asp Val 

101 Ser'Thr Gin Met Leu Asn'oln Gin Phe Gin Glu Pro Phe Val Ala Val 

III 111 lie Asp Pro Thr Arg Thr He Ser Ala Gly Lys val Asn Leu Gly 

foo Ala Phe Arg Thr Tyr Pro Lys Gly Tyr Lys Pro Pro As P Glu Gly Pro 

no ioo i° 5 



U3 Ser Glu Tyr Gin Thr lie Pro Leu Asn Lys lie Glu Asp Phe Gly Val 



m His Cys Lys Gin Tyr Tyr Ala Leu Glu Val Ser Tyr Phe Lys Ser Ser 
III Leu Asp Arg Lys Leu Leu Glu Leu Leu Tr P Asn Lys Tyr Trp Val Asn 



150 155 



125 Thr Leu Ser Ser Ser Ser Leu Leu Thr Asn Ala Asp Tyr Thr Thr Gly 



III Gin Val Phe Asp Leu Ser Glu Lys Leu Glu Gin Ser Glu Ala Gin Leu 



165 



185 



133 Gly Arg Gly Ser Phe Met Leu Gly Leu Glu Thr Hrs Asp Arg Lys Ser 
134 

137 Glu Asp w= ^ 5 22Q 

Kl Glu III He His Gly Leu Met Ser Gin Val lie Lys Asp Lys Leu Phe 



111 Glu Asp III Leu Ala Lys Ala Thr Arg Asp Ser Cys Lys Thr Thr He 



L34 195 200 205 

Leu Ala 

His Gly 

142 225 230 

145 Asn Gin He Asn He Ser 

146 245 

149 <210> SEQ ID NO: 4 

150 <211> LENGTH: 245 
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DATE: 0 2/0 (,/ 200 2 
RAW SEQUENCE LTSTING , n /nA - Ti ]9 :0«): 22 

PATENT API -i.ICA.lON: US/10/04 7,2 5 3 

input Set : A:\CIT1510-4.ST25.txt 
Output Set: N:\CRF3\O20620O2\J047253.raw 

151 <:212> TYPE: PRT 

15^ - ">n> ORGANISM : Homo sapiens 

si r»;"srsi «. - «- « - ~ set Giy sr Gly 

Val ser V,! Glu PI, v.l A3P P" Val PPe Gin PI. LyP Mel Leu PSP 
I?.! Met Leu Lys 1 Thr 01, Pr, Pro Glu Met v.l V.l Gly HP Tyr His 
IS ser His Pro 01, Pise Civ Cys Trp Pee ser Gly »1 MP .L »" 
15' Gin S. ser Phe Glu PL leu ser Glu Pr, »U v.l »» Val v.l val 
}?. Psy Pro lie Gl„ ser val Pys Gly Lys val val He Psp PI, Phe Pr, 
Leu lie Psn PI. Psn Met Met Val Leu Gly His Glu Pro Pr, Gl„ Thr 
SI Thr ser Psn Leu Gly His Leu Psn Lys Pro Ser lie Gin Pre Leu .le 
IS His Gly Leu Psn Pr, His Tyr Lyr ser He Thr lie Pse T,r Pr, L,s 
I?,! psn Glu Leu Glu Gin Lys Met Leu Leu Psn Leu His Lys Lys Ser Trp 
& Met Gin Gly Leu Thr III Gin Psp Tyr Ser Gin His Gys Lys His Psn 
HI Gin ser v.l v.l Lys Gin Met Leu «. Len PI, Lys Psn Tyr Psn Lys 
.SI Pta val Gin Gin Gin Psp Lys Met Thr Pro Gin Gl„ Len Pin He Lys 
22 PS„ V.1 Gly Lys Gin Psp Pro Lys Pr, His Len Gin Gin His v.l Psp 
-09 210 . 21 Leu Asp 

212 Val Leu Met Thr Ser Asn He Val Gin Cys Leu a ^ 

213 225 Ziu 
•16 Thr Val Val Phe Lys 

217 245 

220 <210> SEQ ID NO: 5 

221 <211> LENGTH: 421 

222 <212> TYPE: PRT 

223 <213> ORGANISM: Homo sapiens 

225 <400> SEQUENCE: 5 Glu A Val Ar g 

227 Met Pro Asp His Thr Asp Val Ser Leu Ser ^ 

rii La Leu Ser Lys Leu Gly Cys Asn lie Thr He Ser Glu Asp He Thr 
l\\ Pro Arg Arg Tyr Phe Arg Ser Gly Val Glu Met Glu Arg Met Ala Ser 
;f 9 val Tyr Leu Glu Glu Gly Asn Leu Glu Asn Ala Phe Val Leu Tyr Asn 
i\\ Lys Pne He Thr Leu Phe Val Glu Lys Leu Pro Asn His Arg Asp Tyr 
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OATK: 0 2/0 6/20 U 2 
RAW SEQUENCE LISTING TIME: 10:00:22 

PATENT APPLICATION: US/10/04 7,2.3 

^ 01. C Ma Val Pro Clu W . 01. MP ™ « Ly. Ly. «. ^ 

& 01. Ue P,e Pro M »P - - «' »" £ ^ 
»? c Tyr ... IS? 01. Tyr 01. 01. „. Leu 01. so. ,y. ». ^ Tyr 
»5 w . Ala oil no «. PVS Pys P» 01. His .1. «, - Ue 01. Ma 
260 130 "5 n Leu Glu ser clu 

263 Glu Arg Lys Arg He Ala Gin Met Arg Gi ^ 160 

264 145 I 50 B<=n . Leu Lys Lys Gin Glu Leu Ala Arg 

267 Gin Phe Leu Phe Phe Glu Asp Gin Leu Lys y ^ 

268 " 5 _ ln rln Thr ser Gly Leu Ser Glu Gin He Asp 

271 Gly Gin Met Arg Ser Gin Gin Thr ber y igQ 

272 180 Q qpr Thr His Gin Asn Asn Ser Leu Leu 

275 Gly Ser Ala Leu Ser Cys Phe Ser Thr ^ 

276 195 » " t- „q o pr asp Ala Thr Asn Tyr Ala 
' 279 Asn Val Phe Ala Asp Gin Pro Asn Lys Ser Asp ^ 

280 210 ^ ar . n T Thr p r o Ala Ala Thr Leu 

283 Ser His Ser Pro Pro Val Asn Arg Ala Leu Tnr ^ 

284 225 J30 Val Val L eu 
->87 ser Ala Val Gin Asn Leu Val Val Glu biy ^ 

288 245 Qln Leu Ala Glu Ser Asn Thr 

291 Pro Glu Asp Leu Cys His Lys Phe Leu ^ Q 

292 , 2 M riu Thr Cys Gly He Leu Cys Gly Lys Leu Thr His 

295 Val Arg Gly He Glu Thr cys o y 2g5 

296 275 u - c v"l He Val Pro Lys Gin Ser Ala Gly 

299 Asn Glu Phe Thr He Thr His Val He ^ 

300 290 he Asn Va i G ln 
30 3 Pro Asp Tyr Cys Asp Met Glu Asn Val Glu Glu ^ 
304 305 310 Gl Trp Ile His Thr His Pro Thr 

307 Asp Gin His Asp Leu Leu Thr Leu Gly i^P 335 

308 325 Qor qpr val Asp Leu His Thr His Cys Ser Tyr 

311 Gin Thr Ala Phe Leu Ser Ser Val Asp ^ 

312 340 al c ser p r o Lys His 

315 Gin Leu Met Leu Pro Glu Ala He Ala ^ 

316 T , lv Tie Phe Arg Leu Thr Asn Ala Gly Met Leu Glu Val 
319 Lys Asp Thr Gly He Phe Arg 380 

3'>0 370 u u -= Dm His Thr Lys Glu Pro Arg 

323 Ser Ala Cys Lys Lys Lys Gly Phe Hrs Pro His 4 00 

324 385 390 , L Vai Lys Asp He Lys He He 

327 Leu Phe Ser He Cys Lys His Val Leu Va^ ^ 

328 405 
3 31 Val Leu Asp Leu Arg 

332 420 

335 <210> SEQ ID NO: 6 

336 <211> LENGTH: 461 
■537 <212> TYPE: PRT 

338 <213> ORGANISM: Homo sapiens 
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DATK: 02/06/2002 
RAW SEQUENCE LISTING , n/nA . T IMF.: 19:09:22 

oAIKIIT APPLICATION: US/10/047, 253 

3 3 :° ^'U'SfSi Phe Thn V,! M. Sen Leu Lys Lys Leu Ma Jl. "en 

343 1 u I v»i ser Leu Ser Pro Glu Glu Arg Val Arg Ala 

346 Pro Asp His Thr Asp Val Ser Leu ser 3o 

347 20 , n i= n Tie Thr lie Ser Glu Asp He Thr Pro 

350 Leu Ser Lys Leu Gly Cys Asn lie Thr ^ 

351 I 5 pho Ara Ser Gly Val Glu Met Glu Arg Met Ala Ser Val 

354 Arg Arg Tyr Phe Arg Ser Giy go 

355 50 x phe Val Leu Tyr Asn Lys 
358 Tyr Leu Glu Glu Gly Asn Leu Glu Asn Ala vn 8Q 

III Phe Ile Thr Leu Phe v° al Glu Lys Leu Pro hsn His hr, hs P Tyr «• 

.1. Cys Ala val Pro .1. Lys =, hsp He Met Lys Lys Leu Ly, Glu 

S'. Ue hi, Phe Pro An, Thr Asp Glu Leu Lys Asu ASP Leu Leu Lys Lys 

371 i 1 ? el„ Tvr Gin Glu Tyr Leu Gin ser Lys Asn Lys Tyr Lys 

374 Tyr Asn Val Glu Tyr bin biu iy i4Q 

3 3 7 7 5 8 Ala G 3 i°u lie Leu Lys Lys III Glu His Gin Arg Leu He Glu Ala Glu 
3 3 7 8 9 2 Arg Lys Arg He Ala Gin Met Arg Gin Gin Gin Leu Glu Ser Glu Gin 
3 3 8 8 3 6 Phe Leu Phe Phe Glu Asp Gin Leu Lys Lys Gin Glu Leu Ala Arg Gly 

387 i 8 ° rm rin Thr Ser Gly Leu Ser Glu Gin He Asp Gly 

390 Gin Met Arg Ser Gin Gin Thr ser v, y ^ 

391 195 ok cor- Thr His Gin Asn Asn Ser Leu Leu Asn 

394 Ser Ala Leu Ser Cys Phe Ser Thr His Gin ^ 

395 210 , n III tv* ser Asp Ala Thr Asn Tyr Ala Ser 

398 Val Phe Ala Asp Gin Pro Asn Lys Ser Asp ft^ 24Q 

399 225 , Ala Leu Thr Pro Ala Ala Thr Leu Ser 

402 His Ser Pro Pro Val Asn Arg Ala Leu Thr ^ 

403 ? 45 v*i val Glu Gly Leu Arg Cys Val Val Leu Pro 

406 Ala Val Gin Asn Leu Val Val Glu GJ-Y ^ 

407 I 60 oi q Tvs Phe Leu Gin Leu Ala Glu Ser Asn Thr Val 
410 Glu Asp Leu Cys His Lys Phe Leu * ^ 

4H 275 „, Tlo ton Cvs Gly Lvs Leu Thr His Asn 

^^ Tio rin Thr Cvs Gly He Leu cys inxy ljo 

414 Arg Gly He Glu mr cys u y 3Q0 

415 290 Lvs GLn ser Ala Gly Pro 

418 Glu Phe Thr Ile Thr His Val He Val Pro ^ 32Q 

419 305 s Mot III Asn Val Glu Glu Leu Phe Asn Val Gin Asp 
422 Asp Tyr Cys Asp Met Glu Asn vai ^ 335 

III 01„ His Asp Len III Thr Len Gly Trp Ue His Thn His Pro Thr Gin 
If, Ihr u . P ne Leu ser ser v,l Asp Leu His Thn His Cys Sen Tyr Gin 
\l\ t eu „en Leu Pno Glu hi. Ue Ala He val Gys Sen Pno Lys His Lys 



435 370 
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L:13 M-.270 c 
L:13 M:271 C 
L: 41 M: 341 W: 
L: 7 0 M:341 W: 



DATE: 0 2/06/200 2 
VERIFICATION SUMMARY -]■ \ \\f\ • 19:00:2) 

PATRHT APPLICATION: US/10/04 7 , 2 5 3 liut " 

Tnout Set : A:\CIT1510-4.ST25.txt 
Output Set: N:\CRF3\02062002\J047253.raw 

m „hpr differs Replaced Current Application No 

( 46) "n" or "Xaa" used, for SEQ ID# : 1 
(46 ) »n" or "Xaa" used, for SEQ ID#:2 
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